-1- 



TITLE OF THE XNVENTION 



CLONED GENES ENCODING REVERSE TRANSCRIPTASE LACKING 

RNase H ACTIVITY 



FIELD OF THE INVENTION 

The invention is 
genetics • 



in the field of recombinant 



SaIDMAK, STERNt. 

Kesslef & Goldstein 



TOOi* 



BACKGROUND OF THE INVENTION 

Both viral and cloned reverse transcriptase (RT) 
contain at least two enzymatic activities, DNA polymer- 
ase and ribonuclease H (RNase H) , that reside on a 
single polypeptide. Grandgenett, D.P. et al . . Proc. 
Natl. Acad. Sci. (USA) 70:230-234 (1973); Moelling, K. , 
Viroioay 62:46-59 (1974); Kotewicz, M.L., et al . , Gene 
35:249-258 (1985); and Roth, M.J., et al . , J. Biol. 
Cheitu 260 ; 9326-9335 (1985), Little is known about the 
structure-functional relationship of these two activi- 
ties, but such knowledge would be important both in 
understanding retroviral replication and in exploiting 
the enzyme as a recombinant DNA tool. 

In the retrovirus life cycle, the RT DNA polymerase 
activity is responsible for transcribing viral RNA into 
double-stranded DNA. Varmus, H. (1982), in Weiss,. R. , 
et al . (eds.)/ RNA Tumor Viruses , Cold Spring Harbor 
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Laboratory, pp. 410-423. The function of RNase H in 
replication is less clear, but it is thought to degrade 
genomic RNA during DNA synthesis to generate oligomer ic 
RNA primers for plus-strand DNA synthesis, and to remove 
the RNA primers of both minus- and plus-strand DNA. 
Omer, C.A. , et al . , Cell 30 :797-805 (1982); Resnick, R. , 
et al . , J, Virol. 51 :813-821 (1984); Varmus, H. (1985), 
in Weiss, R. , et al. (eds.), RNA Tumor Viruses , Cold 
Spring Harbor Laboratory, pp. 79-80* 

The temporal relationship in vivo between DNA 



polymerization and RNA hydrolysis is not well defined. 
Furthermore, precisely how the two enzymatic activities 
are coordinated is not clear. Conditional mutations 
restricted to either DNA polymerase or RNase H would be 
invaluable in deciphering the events of retroviral 
replication. Unfortunately, conditional viral mutations 
in the RT gene invariably affect both activities . Lai, 
M.H.T, et al . . J. Virol. 27:823-825 (1978); Moelling, 
K., et al. , J. Virol. 32:370-378 (1979). 

RT is used extensively in recombinant DNA tech- 
nology to synthesize cDNA from mRNA. One major problem 
with cDNA synthesis is that the RNase H activity of RT 
degrades the mRNA template during first-strand synthe- 
sis. The mRNA poly (A) -oligo (dT) hybrid used as a primer 
for first-strand cDNA synthesis is degraded by RT RNase 
H. Thus, at the outset of cDNA synthesis, a competition 
is established between RNase H-mediated deadenylation of 
mRNA and initiation of DNA synthesis, which reduces the 
yield of cDNA product. Berger, S.L. , et al. , Biochem. 
22:2365-2373 (1983). Furthermore, in some cases, the 
RNase H causes premature termination of DNA chain 
growth. Unfortunately, these events eliminate the 
potential for repeated copying of the RNA template. 



A4 4 . 8 .WP 



120787 



-3- 



Efforts to selectively inactivate RT RNase H with 
site-specific inhibitors have been unsuccessful (for 
review, see Gerard, G.F. (1983), in Jacob, S.T., (ed.), 
Enzvmes of Nucleic Acid Synthesis and Modification . Vol. 
I, DNA Enzymes, CRC Press, Inc. , Boca Raton, FL, pp. 1- 
38) . Attempts to physically separate the active centers 
of RT polymerase and RNase H activity by proteolysis 
have yielded a proteolytic fragment possessing only 
RNase H activity (Lai, M.H.T., et al. . J, Virol . 25:652- 
663 (1978); Gerard, G.F., J, Virol, 26 :16-28 (1978); and 
Gerard, G.F., J. Virol. 37:748-754 (1981)), but no 
corresponding fragment containing only polymerase 
activity has been isolated. 

Computer analysis of the amino acid sequences from 
the putative gene products of retroviral pol genes has 
revealed a 150-residue segment at the carboxyl terminus 
that is homologous with the ribonuclease H of E . coli 
and a section close to the amino terminus which can be 
aligned with nonretroviral polymerases. Johnson, M.S., 
et al. . Proc. Natl. Acad. Sci. (USA) 83:7648-7652 
(1986) . Based on these related amino acid sequences, 
Johnson et al . suggest that ribonuclease H activity 
should be situated at the carboxyl terminus, and the DNA 
polymerase activity at the amino terminus. 

There have been a number of reports concerning the 
cloning of genes which encode RT and their expression in 
hosts. Weiss et al. » U.S. Patent No. 4,663,290 (1987); 
Gerard, G.F., DNA 271-279 (1986); Kotewicz, M.L. , et 
al . , Gene 3.5:249-258 (1985); Tanese, N. , et al . , Proc. 
Natl. Acad. Sci. (USA) 82-4944-4948 (1985); and Roth, 
M.J., et al. , J. Biol. Chero. 260:9326-9335 (1985). 

There has been no direct scientific evidence that 
amino acid residues involved catalytical ly or 
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structurally in the RNase H activity of reverse 
transcriptase could be altered to eliminate RNase H 
activity without affecting the RNA-dependant DNA 
polymerase activity of reverse transcriptase* Moreover, 
there has been no report of the cloning of RT to give a 
gene product without RNase activity. 



SUMMARY OF THE INVENTION 

The invention relates to a gene which encodes 
reverse transcriptase having DNA polymerase activity and 
subst ant i a 1 ly no RNase H activity. 

The invention also relates to a reverse transcrip- 
tase gene comprising the following DNA sequence: 

1078 

A0G KX2 CIA AAT MA GAA (2£F GAG CAT COG OA CAT fflG ADC TCA AAA G£G OCA GAT GET 
MET Hit lea Asn lie Glu Asp Glu His Arg leu His Glu Thr Ser Iys Glu Pro Asp Val 

1138 

TCP Cm G3G TOO ACA TOS CTG TCP GAT TIT OCT C2G QCC T33 GCG GAA ACC OOG G3C ATC 
Ser leu GLy Ser Ihr Trp leu Ser Asp Ifre Pro GLn Ala Trp Ala Glu Thr Gly GLy 

1198 

GGA CDS OCA GIT CDC CAA OCT CUT GIG ATC ATA OCT CTG AAA GGA AX ICT NX: CCC GIG 
GLy leu Ala Val Arg Gin Ala Pro leu lie lie Pro leu Iys Ala Ihr Ser Ihr Pro Val 

1258 

ICC MA AAA CAA TAG CCC ATC TCA CAA GAA GCC PGk CIG Q33 ATC j&G GX CAC A2A 0£ 
Ser lie Iys Gin Tyr Pro MET ser Gin Glu Ala Arg lea GLy lie Iys Pro His lie Gin 

1318 

AGA CTG TIG GAC CZ£ G3\ A3A CTG GIA OCC IGC C?^ IDC CDC 1G3 AAC ACG OCC CIG CCA 
Arg leu leu Asp Gin Gly lie leu Veil Pro Cys Gin Ser Pro Trp Asn Ihr Pro leu Leu 

1378 

OX GIT AAG AAA CCA OOG ACT AAT GAT IAT AOG OCT GTC CAG GAT CTG AGA ®A GIC AAC 
Pro Veil Iys Iys Pro Gly Ihr Asn Asp Tyr Arg Pro V&L Gin Asp leu Arg Glu Val Asn 

1438 

AAG C3G GIG GAA GAC ATC C?£ CDC MC GIG CDC AAC OCT T&C AAC CDC TIG AGC G33 CTC 
Iys Arg \fol Glu Asp lie His Pro Ihr Veil Pro Asn Pro Tyr Asn leu Isa Ser GLy leu 
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1498 

OCA OX TOC CAC CA; TOG TAC ACT GIG CUT GftT TTA AAS GAT QX TIT TIC 1GC CDS 
Pro Pro Ser His (2Ln Trp Tyr Thr Val leu Asp leu lys Asp Ala Hie Ibe cys lea Arg 

1558 

CTC CAC CDC AX A3T CftG OCT CTC TIC GX TIT GA3 TOG AGA GAT CCA G&G A1G G2A ATC 
lea His Pro Thr Ser Gin Pro Lea Ite Ala Ihe GLu Trp Arg Asp Pro GLu h£T Gly lie 

1618 

TCA Q3A CAA TTG AX TG3 AX AGA CIC CCA CAG GGT TIC AAA AAC ACT CDC AOC CIG TTT 
Ser Gly Gin lea Thr Tip Thr Arg Lau Pro Gin Gly Ite lys Asn Ser Pro Thr lea Ihe 

1678 

GAT GA3 GCA CIG CAC ASA GAC CIA GCA C2*C TIC 033 ATC CftG CA2 CCA GAC TIG ATC CIG 
Asp Glu Ala lea His Arg Asp Ibu Ala Asp Hie Arg lie Gin His Pro Asp l£u He leu 

1738 

CIA TAC GIG GAT GAC TEA CIG CIG QOC GX ACT TCT GKG CEA GAC TGC CAA CAA <33T 
lisu Gin Tyr Val Asp Asp lea leu laa Ala Ala Thr Ser Glu lea Asp cys Gin Gin Gly 

1798 

AIT COS GX CTC TEA CAA AX CEA GOG AAC CTC GOG TAT OGG GX TCG GX AAG AAA GX 
Ihr Arg Ala leu lea Gin Thr lea Gly Asn lea Gly Tyr Arg Ala Ser Ala lys lys Ala 

1858 

CAA ATT TGC CA3 AAA CAG GIC AAS TAT CIG GGG TAT CUT CIA AAA ®G <XT CAS AGA TOG 
Gin lie Cys Gin lys Gin VaL lys Tyr leu Gly Tyr lea lea lys Glu Gly Gin Arg Trp 

1918 

CIG ACT GAG GX A3A AAA <3>G AIT GIG AIG GX CAG OCT ACT COG AAG AX (XT CGA CAA 
lea Thr Glu Ala Arg lys Glu Thr VSal MET Gly Gin Pro Thr Pro lys Thr Pro Arg Gin 

1978 

CEA AX TIC CEA GX AX GCA GX TTC TGT CDC CTC TOG ATC XT GX TTT GCA GAA 
leu Arg Glu Ihe Lea Gly Thr Ala Gly Phe Cys Arg laa Trp lie Pro Gly Ihe Ala Glu 

2038 

AIG GCA GX OX TTG TAC CUT CTC AX AAA AX GOG ACT CIG TTT AAT TGC GX OCA GPC 
M2T Ala Ala Pro I/su Tyr Pro Lea Thr Lys Thr Gly Thr Lsa Ihe Asn Trp Gly Pro Asp 

2098 

CAA CAA AAG GX TAT CAA GAA ATC AAG CAA OCT CUT CTA ACT GX GCA GX CIG GX TTG 
Gin Gin lys Ala Tyr Gin Glu lie lys Gin Ala leu leu Thr Ala Pro Ala Leu Gly l£ii 

2158 

OCA CTT TTG ACT AAG OX TTT GAA CIC TTT GDC G£C G£G AAG CAG GX TAC GX AAA G3T 
Pro Asp laa Thr IysProIteGlul£uRTeV^Asp GLu lys Gin Gly Tyr Ala lys Gly 
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2218 

GIC C3A MG CAA AAA CIG OCA OCT TG3 OCT 03^ COG GIG Q3C TAC CTG IOC AAA AM CEA 
Vkl leu Thr Gin lys leu GLy Fro Trp Arg Arg Pro Veil Ala Tyr Lea Ser lys lys leu 

2278 

GAC OCA GIA GGA OCT G3S IQS CDC (XT TQC CIA CGS AIG GIA OCA GCC ATT GCC GIA CIG 
Asp Pro Veil Ala Ala Gly Trp Fto Pro Cys leu Arg MET Val Ala Ala lie Ala Val lea 

2338 

TOV AM GAT GCA Q3C AM CEA AGC MG G2A CM OCA CEA GIC ATT CTG GCC CDC CAT GCA 
Ihr lys Asp Ala Gly lys leu Ihr MT Gly Gin Pro leu Ve\l lie Lea Ala Pro His Ala 

2398 

GIA <3C GCA CEA GIC AAA CAA OCC OCC GAC CDC TOG CUT IOC AAC GCC COG AIG ACT CAC 
Val Glu Ala leu Veil Lys Gin Pro Vro tep Arg Trp lea Ser Asn Ala Arg MET Ihr His 

2458 

TAT CM GCC TIG COT TIG GM MS GM 033 GIC CM TIC GGA <XG GIG GEA GCC CIG AM 
Tyr Gin Ala Leu leu Leu Asp Ihr Asp Arg Veil Gin She Gly Pro V3l Veil Ala Leu Asn 

2512 

COG OCT PCG CTG CTC OCA CIG OCT GM GAA QGG CTG CAA CM AM TQC CUT GAT 
Pro Ala Ihr Leu Leu Pro Leu Vro Glu Glu Gly leu Gin His Asn Cys Leu Asp 

or the degenerate variants thereof* 

The invention also relates to a reverse 
transcriptase gene comprising the following DNA 
sequence: 

1078 

AIG MC CEA AAT ATA G&A GAT GM CAT G3G CEA CAT MC TCA AAA OCA GAT GIT 
hET Ihr leu Asn Lie Glu Asp Glu His Arg leu His Glu Thr Ser lys GLu Pro A^> Val 

1138 

ICT CIA GGG IOC MA IGG CIG TCT GAT TIT OCT CM GCC IQG GCG <3\A MC QGG GGC AIG 
Ser leu Gly Ser Ihr Trp leu Ser Asp Ite Pro Gin Ala Trp Ala Glu Ihr Gly Gly MET 



1198 

GCA CTG OCA GIT 03C CAA OCT OCT CTG ADC AEA OCT CTG AAA GCA MC ICT MC OCC GIG 
Gly Leu Ala Val Arg Gin Ala Pro leu lie lie Pro Leu Lys Ala Thr Ser Ihr Pro Val 

1258 

IOC AEA AAA CAA TM OCC ACG TCA CAA GAA GCC MA CTG GOG ATC AM OOC CM AEA CAG 
Ser lie lys Gin iyr Pro MET ser GLn Glu Ala Arg Leu Gly lie lys Pro His lie Gin 
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1318 

MA CTG TIG GM CM GCA ALA CIG GEA CDC IOC CM ICC CDC T3G AM ACG OCC CTG CTA 
Arg leu leu Asp Gin Gly He leu Val Pro Cys Gin Ser Pro Trp Asn Ihr Pro leu leu 
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OOC GET AM AAA CXA G3S MT AAT GAT TOT AGS OCT GIC CM <3£? GIG AGA GPA GFC AM 
Pro \^ Iys Iys Pro Gly Ihr Asn Aq? iyr Arg Pro V£il Gin Asp leu Arg GLu Val Asn 



1438 



AM COS GIG GAA GM ATC CM CDC ADC GIG OX AM OUT TOC AM CDC TIG AGC Q3S CTC 
lys Arg Val GTu Asp lie His Pro Thr Val Pro Asn Pro Tyr Asn leu Leu Ser GLy Lai 

1498 

OCA OOG TOC CM CM TQS TOC MI GIG CUT GKF TTA AM GAT GOC TTT TIC TQC CIG AGA 
Pro Pro Ser His Gin Trp Tyr lhr Val leu Asp leu Iys Asp Ala Ibe Hie cys Lai Arg 

1558 

CTC CM CDC AX A3T CM OCT CTC TDC QCC TIT GSG 1X33 AGA <3¥T OCA GM ACTG QSA ATC, 
leu His Pro lhr Ser GLn Pro leu Ete Ala Hie Glu Trp Arg Asp Pro GLu hET GLy lie 

1618 

TCA GJSA CAA TIG ACC 1GS MC MA CTC CEA CM GST TIC AAA AM AGT CDC ACC CIG TIT 
Ser GLy GLn leu lhr Trp Thr Arg leu Pro Gin GLy Hie Iys Asn Ser Pro lhr leu Hie 

1678 

GAT GM OCA CTC CM AGA GM CEA OCA GM TIC CDS ATC CM CM CEA GM TIG ATC CIG 
Asp Glu Ala lea His Arg Asp lea Ala Asp Hie Arg lie Gin His Pro Asp leu lie leu 

1738 

CIA CM TM GIG C^T GM TIA CTC CIG GCC OOC MT TCT GPG CEA GM TCC CAA CAA GST 
leu Gin Tyr Val Asp Asp leu leu leu Ala Ala Thr Ser GLu leu Asp Cys Gin Gin GLy 

1798 

MT COG GOC CTC TIA CAA ADC CEA GOG AM CTC G33 TOT CDS GDC TOG GCC AM AAA GOC 
lhr Arg Ala Lea lea Gin Thr lea Gly Asn lea GLy Tyr Arg Ala Ser Ala Iys Iys Ala 

1858 

CAA ATT TQC CM AAA CM GEC AM TOT CTC G33 TOT CUT CEA AAA GM GST CM MA TGS 
Gin lie Cys GLn Iys GLn Val Iys iyr leu Gly iyr leu lea Iys GLu Gly Gin Arg Trp 

1918 

CTC MT GM GCC AGA AAA GM ACT GEG AIG 033 CM OCT ACT CDS AM ADC OCT CCA CAA 
Lea Thr GLu Ala Arg Iys Glu lhr Val WET GLy Gin Pro Thr Pro Iys lhr Pro Arg Gin 

1978 

CEA MG GM TIC CIA G3G MS OCA G3C TIC TGT OGC CTC TOG ATC OCT G3S TTT GCA GAA 
Lea Arg Glu He leu GLy Thr Ala GLy Hie Cys Arg leu Trp lie Pro Gly Hie Ala GLu 

2038 

ATC GCA GCC OOC TIG TOC OCT CTC MC AAA ADS G3S ACT CTC TTT AAT TGS G3C CCA GM 
*ET Ala Ala Pro Leu Tyr Pro lea Thr Iys lhr GLy lhr leu Hie Asn Trp GLy Pro Asp 

2098 
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CAA CAA M; GX TAT CAA GAA MC Ate CAA OCT CUT Cm teT GCC CCA QX CTG TIG 
Gin GLn Iys Ala iyr GLn GLu He Iys Gin Ala la) lea Thr Ala Pro Ala Leu GLy lea 

2158 

OCA GAT TIG teT Ate OCC TIT GAA CEC TIT GIC QVC Gte Ate CAG G3C Tte GCC AAA OCT 
Pro Asp Lea Thr Iys F*x> Ibe GLu leu Ehe V£lL tep GLu Iys Gin GLy T/r Ala Iys GLy 

2218 

GIC CIA ACG CAA AAA CLG G3A OCT TOG GGT GOG OCG GIG GCC Tte GIG TOC AAA Ate CEA 
Val leu Thr GLn Iys lea GLy Pro lip Arg Arg Pro V&l Ala iyr leu Ser Iys Iys leu 

2278 

GAC CCA GEA GCA OCT Q3S TGG CDC CCT TOC CIA 03S AEG GIA GCA GCC ATT GCC GEA GIG 
Asp Pro V£lL Ala Ala GLy Trp Pro Pro C^s leu Arg MST Val Ala Ala lie Ala Veil lea 

2338 

ACA Ate GKP GCA Q3C Ate CEA teC ATO GGA Cte CCA CIA GDC ATT GIG GCC <XC CAT GCA 
Thr Iys Asp Ala GLy Iys lea Ihr M2T Gly Gin Pro leu Veil lie leu Ala Pro His 70a 

2398 

GEA GteGC^CCAGrcAAA(^aOCeiX:(^ AIG teT Cte 

Val GLu Ala leu Val Iys GLn Pro Pro Asp Arg Trp Leu Ser Asn Ala Arg MET Thr His 

2458 

TAT Cte GDC TIG CUT TIG GAC teG Gte OGC GDC Cte TIC G3A CTG GIG GIA GCC CIG Ate 

Tyr GLn Ala lea lea lea Asp Thr Asp Arg Val GLn Ihe Gly Pro Val Val Ala leu Asn 

2518 

OCG OCT AGG CIG GIC CCA CIG OCT Gte GAA G3C CIG CAA Cte Ate IGC CET OT AAE IOC 
Pro Ala Thr leu leu Pro leu Pro GLu GLu Gly leu GLn His Asn Cys leu Asp Asn Ser 

2530 

GGC TEA ATT AAT 
Arg leu lie Asn 

or the degenerate variants thereof. 



The invention also relates to the vectors contain- 
ing the gene of the invention, hosts transformed with 
the vectors of the invention, and the reverse transcrip- 
tase expressed by the transformed hosts of the inven- 
tion. 

The invention also relates to a fusion protein 
comprising a polypeptide having RNA-dependent DNA 
polymerase activity and substantially no RNase H 
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activity and a second peptide selected from polypeptide 
proteins which stabilize the fusion protein and hydro- 
phobic leader sequences. 

The invention also relates to a method of producing 
reverse transcriptase having DNA polymerase activity and 
substantially no RNase H activity, comprising culturing 
transformed hosts of the invention under conditions 
which produce reverse transcriptase , and isolating the 
reverse transcriptase so produced. 

The invention also relates to a method of preparing 
cDNA from mRNA comprising contacting mRNA with a 
polypeptide having RNA-dependent DNA polymerase activity 
and substantially no RNase H activity, and isolating the 
cDNA so produced. 

The invention also relates to a kit for the 
preparation of cDNA from mRNA comprising a carrier being 
compartmentalized to receive in close confinement 
therein one or more containers, wherein 

(a) a first container contains reverse 
transcriptase having DNA polymerase activity 
and substantially no RNase H activity; 

(b) a second container contains a buffer and the 
nucleoside triphosphates; 

(c) a third container contains oligo (dT) primer ; 
and 

(d) a fourth container contains control RNA. 

The invention is related to the discovery that 
portions of the RT gene can be deleted to give a 
deletion mutant having DNA polymerase activity but no 
detectable RNase H activity. This purified mutant RT 
lacking RNase H activity can be used to effectively 
synthesize cDNA from mRNA, 



A44 . 8.WP 



120787 



-10- 



Saidman. Sterne . 
ess l e Ft & Goldstein 



DESCRIPTION OF THE FIGURES 

Figure 1 * This figure depicts the restriction map 
of plasmid pRT601. The M-MLV RT gene extends from 
position 1,019 to 3,070. 

Figure 2 . This figure depicts schematic represen- 
tation of pRT601 and related plasmids, and the enzymatic 
activities and predicted structure of the M-MLV RT 
protein coded by each plasmid. 

Figure 3 . This figure depicts an SDS-polyacryla- 
mide gel of M-MLV RT. pRTdEcoRV-C RT (A) and pRT601 RT 
(B) (3 p.q of each) were run on an SDS 10% polyacrylamide 
gel (Laemmli, U.K., Nature 227:680-685 (1970)). The gel 
was stained with Coomassie blue. Lane M contained Mr 
standards . 

Figure 4 . This figure depicts an autoradiogram of 
32 P-labeled cDNA synthesized from 6.2 kb RNA (Materials 
and Methods) by pRTdEcoRV-C RT (A) or pRT601 RT (B) . A 
1 kb ladder was used as a standard (C) . Electrophoresis 
was performed on an alkaline 1.4% agarose gel (McDonnel, 
M.W., et al . , J. Mol. Biol. 110 :119-146 (1977)). 

Figure 5 . This figure depicts an autoradiogram of 
32 P-labeled 2.3 kb poly (A) -tailed RNA after oligo(dT)- 
primed cDNA synthesis catalyzed by pRTdEcoRV-C RT or 
pRT601 RT. Aliquots were removed from reaction mixtures 
containing no enzyme (-E) or 200 units of RT at the 
times indicated (in min) . The minus enzyme control was 
incubated for 60 min. Samples were electrophoresed as 
described in Materials and Methods. A 1 kb ladder was 
used as marker (M) . 

Figure 6 . This figure depicts the DNA sequence 
which encodes reverse transcriptase having DNA polymer- 
ase activity and substantially no RNase H activity. 
Also shown is the corresponding amino acid sequence. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The invention relates to the production of reverse 
transcriptase having DNA polymerase activity and 
substantially no RNase H activity, using recombinant DNA 
techniques. 

Recombinant plasmids constructed as described 
herein provide reverse transcriptase for use in re- 
combinant DNA technology to synthesize cDNA from mRNA 
without the problem associated with RNase H activity 
which degrades mRNA template during first-strand 
synthesis . 

By the terms "substantially no RNase H activity" is 
intended reverse transcriptase purified to near homo- 
geneity and having an RNase H activity of less than 
0*001 pmoles [ 3 H](A) n solubilized per jsg protein with a 
[ 3 H] (A) n * (dT) n substrate in which the [ 3 H](A) n has a 
specific radioactivity of 2,200 cpm/pmole. RNase H 
activities of this specific activity or less allows the 
preparation of cDNA without significant degradation of 
the mRNA template during first-strand synthesis. 

By the terms "degenerate variants" is intended 
cloned genes having variations of DNA sequence, but 
which encode the same amino acid sequence. 

The reverse transcriptase gene (or the genetic 
information contained therein) can be obtained from a 
number of different sources. For instance, the gene may 
be obtained from eukaryotic cells which are infected 
with retrovirus, or from a number of plasmids which 
contain either a portion of or the entire retrovirus 
genome. In addition, messenger RNA-like RNA which 
contains the RT gene can be obtained from retroviruses. 
Examples of sources for RT include, but are not limited 
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to, Moloney murine leukemia virus (M-MLV) ; human T-cell 
leukemia virus type I (HTLV-I) ; bovine leukemia virus 
(BLV) ; Rous Sarcoma Virus (RSV) ; human immunodeficiency 
virus (HIV) ; yeast, including Saccharomyces, Neurospora, 
Drosophila; primates; and rodents. See, for example, 
Weiss et al . . U.S. Patent No. 4,663,290 (1987); Gerard, 
G«R. , DNA 5:271-279 (1986); Kotewicz, M.L., et al. , Gene 
25:249-258 (1985); Tanese, N. , et al. , Proc. Natl. Acad. 
Sci. (USA) £2:4944-4948 (1985); Roth, M.J., et al . , 
Biol. Chem. £60:9326-9335 (i 98 5) ; Michel, F. , et al . , 
Nature 316 :641-643 (1985); Akins, R.A. , et al. . Cell 
42:505-516 (1986), EMBO J. 4:1267-1275 (1985); and 
Fawcett, D.F., Cell 47 : 1007-1015 (1986). 

RT proviral DNA can be isolated using standard 
isolation techniques. The DNA is cleaved into linear 
fragments, any one of which may contain the genes which 
encode RT. Such fragmentation can be achieved using 
enzymes which digest or cleave DNA, such as restriction 
enzymes which cleave DNA as specific base sequences. 
After the linear DNA fragments are generated, they are 
separated according to size by standard techniques. 
Such recombinant DNA techniques may be performed as 
described by Maniatis, T. , et al . , Molecular Cloning: A 
Laboratory Manual . Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York (1982) . 

Identification of the DNA fragment which contains 
the gene may be accomplished in a number of ways. For 
example, it is possible to sequence the DNA fragments 
(Maxam and Gilbert, Methods in Enzymoloay 64_:499 (1980); 
Messing, J., Meth. in Enz. 101C :20 (1983)) to identify 
which fragment contains the reverse transcriptase gene. 
Alternatively, hybridization techniques (Southern, 3_^_ 
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Mol. Biol. £8:503 (1975)) using a labeled (e.g., 
radioactively labeled) DNA probe may be employed. 

The fractions containing the desired DNA are 
pooled, ligated into a suitable vector, and used to 
transform a host. Screening for transformed hosts 
containing the RT gene may be accomplished by, for 
example, the method disclosed by Gerard et al . , Biochem. 
13:1632-1641 (1974) or by Gerard et al . . J. Virol. 
15:785-797 (1975). Alternatively, clones containing 
reverse transcriptase may be identified by hybridization 
with complementary labeled DNA. 

An alternative to isolating the reverse transcrip- 
tase gene from a retroviral proviral DNA is to make cDNA 
to the mRNA-like RNA which codes for reverse transcrip- 
tase. To this end, mRNA-like RNA coding for reverse 
transcriptase is isolated from retrovirus. By standard 
techniques, the isolated mRNA is then converted into 
cDNA using reverse transcriptase. The cDNA can then be 
inserted into a plasmid vector in a conventional manner. 

The choice of a suitable vector depends on a number 
of considerations known to one of ordinary skill in the 
art, such as the size of the fragment, nature of the 
host, number and position of restriction sites desired, 
and the selection marker and markers desired. Such 
vectors may include replicon and control sequences from 
species compatible with a host cell (see Maniatis et 
al. , supra ) . Expression of the RT genes may also be 
placed under control of other regulatory sequences 
homologous or heterologous to the host organism in its 
untrans formed state. For example, lactose-dependent E. 
coli chromosomal DNA comprises a lactose or lac operon 
which mediates lactose utilization by elaborating the 
enzyme £-galactosidase . The lac control elements may be 
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obtained from bacteriophage lambda plac 5, which is 
infectious for E. coli . The J.ac promoter-operator 
system can be induced by IPTG. 

Other promoter-operator systems or portions thereof 
can be employed as well. For example, galactose, 
alkaline phosphatase, tryptophan, xylose, tac , lambda 
pL, lambda pR and the like can be used. Once the 
vector or DNA sequence containing the constructs has 
been prepared, the vectors may be introduced into an 
appropriate host. Various techniques may be employed 
such as protoplast fusion, CaCl2, calcium phosphate 
precipitation, electroporation, or other conventional 
DNA transfer techniques. The vectors may then be 
introduced into a second host by similar transfer 
methods, and also by cell to cell transfer methods such 
as conjugation* This cell-to-cell transfer may be ac- 
complished using known techniques which depend upon the 
nature of the transferer bacterium, the recipient 
bacterium, and the cloning vector used to propagate the 
RT DNA. The transfer may require the use of a helper 
plasmid. See, for example, Ditta, G. , et al . , Proc. 
Natl. Acad. Sci. (USA) 77 ;7347-7351 (1980). 

RT genes having DNA polymerase activity and 
substantially no RNase H activity may be obtained by 
deletion of deoxyribonucleotides at the 3 1 end of the 
gene which encode the portion of the polypeptide having 
RNase H activity. Deletions of the RT gene may be 
accomplished by cutting the plasmid at selected restric- 
tion sites within the RT gene and discarding the excised 
fragment. Further deletion of consecutive 

deoxyribonucleotides may be accomplished by treating the 
fragment with an exonuclease. The DNA ends may then be 
joined in such a way that the translation reading frame 
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of the gene is maintained. The plasmid thus obtained 
may then be used to transform hosts which may then be 
screened for altered RT activity. RT RNase H activity 
may be assayed according to Gerard et al. , J. Virol. 
15:785-797 (1975). DNA polymerase activity may be 
assayed according to Gerard et al . , Biochem. 13 : 1632- 
1641 (1974) • Clones having DNA polymerase activity and 
substantially no RNase H activity may be used to prepare 
RT with altered activity. 

According to these methods, the portion of the RT 
gene derived from M-MLV which encodes DNA polymerase was 
localized to about 1495 base pairs (about 1018 to about 
2512) as shown in Figure 6. The protein expressed by 
this gene has about 503 amino acids (Figure 6). This 
protein has DNA polymerase activity and substantially no 
RNase H activity. 

The invention also relates to fusion proteins which 
comprise the reverse transcriptase of the invention. 
Such fusion proteins may comprise , for example, a 
carrier protein which has a leader sequence of hydro- 
phobic amino acids at the amino terminus of the reverse 
transcriptase. This carrier protein is normally 
excreted through the membrane of the cell within which 
it is made. By cleavage of the hydrophobic leader 
sequence during excretion, a means is provided for 
producing reverse transcriptase which can be recovered 
either from the periplasmic space or the medium in which 
the bacterium is grown. The use of such a carrier 
protein allows isolation of reverse transcriptase 
without contamination by other proteins within the 
bacterium, and achieves production of a form of reverse 
transcriptase having greater stability by avoiding the 
enzymes within the bacterial cell which degrade foreign 
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proteins. The DNA and amino acid sequences for such 
hydrophobic leader sequences, as well as methods of 
preparing such fusion proteins are taught , for example, 
by Gilbert et al. , U.S. Patent 4,411,994 (1983). 

It is also possible to prepare fusion proteins 
comprising the reverse transcriptase of the invention 
which is substituted at the amino or carboxy termini 
with polypeptides which stabilize or change the solu- 
bility of the reverse transcriptase. An amino-tenninal 
gene fusion which encodes reverse transcriptase, having 
both DNA polymerase and RNase activity, and trpE taught, 
for example, by Tanese, N. et al . . Proc. Nat'l, Acad. 
Sci. 82:4944-4948 (1985). A carboxy- terminal gene 
fusion which encodes reverse transcriptase and part of 
the plasmid pBR322 tet gene is taught, for example, by 
Kotewicz, M. , et al. , Gene 35:249-258 (1985) and Gerard, 
G. , DNA 5:271-279 (1986)* 

The transformed hosts of the invention may be 
cultured under protein producing conditions according to 
any of the methods which are known to those skilled in 
the art. 

The reverse transcriptase having DNA polymerase 
activity and substantially no RNase activity may be 
isolated according to conventional methods known to 
those skilled in the art. For example, the cells may be 
collected by centrifugation, washed with suitable 
buffers, lysed, and the reverse transcriptase isolated 
by column chromatography, for example, on DEAE— cel- 
lulose, phosphocellulose (see Kotewicz et al . , Gene 
355:249-258 (1985)) or other standard isolation and 
identification techniques using, for example, 
polyribocytidylic acid-agarose, or hydroxylapatite or 
by electrophoresis or immunoprecipitation . 
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Th e reverse transcriptase so produced may be used 
to prepare cDNA from RNA by, for example, hybridizing an 
oligo(dT) primer or other complementary primers with the 
mRNA. The synthesis of a complete cDNA may be ac- 
complished by adding the reverse transcriptase and all 
four deoxynucleoside triphosphates. Using the reverse 
transcriptase produced by the present invention allows 
for the preparation of cDNA from mRNA without concomi- 
tant degradation of the mRNA which results in incomplete 
cDNA synthesis. The resulting RNA-DNA hybrid may be 
treated, for example, with alkali or RNase H to selec- 
tively hydrolyze the RNA to leave cDNA which may be 
converted to double-stranded form in a second DNA reac- 
tion catalyzed by reverse transcriptase or other DNA 
polymerase. See Old, R.W. , et al . , Principals of Gene 
Manipulation , second edition, Studies in Microbiology, 
Vol. 2, University of California Press, p. 26 (1981). 

The reverse transcriptase of the invention is 
ideally suited for incorporation into a kit for the 
preparation of cDNA from RNA. Such a kit may comprise a 
carrier means being compartmentalized to receive a close 
confinement therein, one or more container means, such 
as vials, tubes, and the like, each of said container 
means comprising one of the separate elements of the 
method used to prepare cDNA from RNA. For example, 
there may be provided a container means containing 
reverse transcriptase having DNA polymerase activity and 
substantially no RNase H activity, in solution. Further 
container means may contain suitable buffers, substrates 
for DNA synthesis such as the deoxynucleoside triphos- 
phate, oligo(dT) primer, and control RNA for use as a 
standard. 
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Th e reverse transcriptase may be present in the 
solution at a concentration of 200 units/ml to 4 00 
units/ml. The deoxynucleoside triphosphases may be 
present either in lyophilized form or as part of a 
buffer at a concentration of 0.5 biM to 2 mM. A suitable 
buffer, present at 5 times the final concentration of 
use, includes 250 mM Tris-HCl (pH 7.5 to 8.3), 375 mM 
KC1, 15 mM MgCl 2 , and 50 mM dithiothreitol . The oligo 
(dT) may be present at a concentration of 5 /ig/ml to 20 
Mg/ml. Control RNA, such as 2.3 kb control RNA, may be 
present at a concentration of 10 jxg/ml to 20 jig/ml. 

The following examples are illustrative but not 
limiting of the methods and compositions of the present 
invention. Any suitable modifications and adaptations 
which are obvious to one of ordinary skill in the art in 
recombinant DNA techniques are within the spirit and 
scope of the present invention. 

Examples 
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MATERIALS AND METHODS 
Plasmids and Bacterial Strains 

For deletion analysis of RT, a clone of M-MLV RT 
was constructed to overproduce stable RT in Escherichia 
coli , pRT601 (Figure 1). Gerard, G.F., et al . , DNA 
5:271-279 (1986) • It is a pBR322 replicon containing 
the strong lambda leftward promoter , pL, and the 
ribosome binding site of the lambda ell gene. (Higher 
copy number derivatives of pBR322, such as pUC plasmids, 
can also be used.) The coding sequence for the RT gene 
was carefully engineered into this plasmid to produce a 
protein with the amino terminus of the viral protein, 
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and a carboxy terminus similar to the viral enzyme. 
Gerard, G.F., supra . 

Two bacterial strains were used to propagate clones 
and express RT: K802 (Maniatis, T. , et al. , (1982), 
Molecular Cloning: A laboratory Handbook , pp. 504-505, 
Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York), made lysogenic for lambda cl ind lts857 Sam7, and 
N4830 (Gottesman, M.E, , et al. f J, Mol. Biol. 140 :57-75 
(1980)), which contains a deleted a cryptic lambda 
prophage expressing the temperature sensitive cl allele 
ind lts857 . Bacteria were grown in super broth (SB) 
containing 2% tryptone, 1% yeast extract, 0.1% NaCl, pH 
7.5, and 50 M9/ m l ampicillin. 

Plasmid Construction 

Standard procedures for plasmid construction were 
performed as described previously (Kotewicz, M.L., et 
al. f Gene 35:249-258 (1985); Gerard, G.F., et ah , DNA 
5:271-279 (1986) ) . 

Temperature Induction of E. coli Carrying pRT601 and its 
Derivatives 

Cultures of bacteria were grown in SB broth 
overnight at 32 °C and diluted 1:20 in fresh SB in the 
morning. The cells were grown at 32 'C until the A 5gQ 
was 0.8, and were induced by swirling in a 65 "C water 
bath until the temperature reached 42 °C. Induction was 
continued for 3 0 minutes in a shaking water bath at 
42*C, and then the cultures were incubated at 37 *C and 
grown an additional 30 minutes. 
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Preparation of Cell Extracts 

Unless otherwise noted, all operations were 
performed at 4*C, Pelleted cells from one ml of culture 
were washed, lysed, and centrifuged as described 
previously (Kotewicz, M.L., supra ) > Supernatants were 
removed and assayed for RNase H and DNA polymerase 
activity ♦ 



Enzvmes Assays 

M* RT DNA polymerase activity in extracts was assayed 

y specifically by using poly (2 1 -Q-me thy Icy tidy late) •oligo- 

m deoxyguanylate [ (Cm) n * ( dG )i2-18^ (Gerard, G.F., et al» . 

=p DNA 5:271-279 (1986), eliminating interference from 

r Z cellular DNA polymerases. To establish DNA polymerase 

go j specific activities of purified RT preparations, 

| activity was assayed with (A) n * (dT) 12 -18 ( Houts / 
ffi et al. , J. Virol. 29:517-522 (1979) as described by 

fU Gerard, G.F., et aL f DNA 5:271-279 (1986). One unit of 

;!l DNA polymerase activity is the amount of enzyme that 

Q I incorporates one nmole of deoxynucleoside monophosphate 

into acid insoluble product at 37 *C in 10 min. 
j RNase H activity in crude extracts and purified 

enzyme was assayed in reaction mixtures (50 jxl) contain- 
j ing 50 mM Tris-HCl (pH 8.3), 2 mM MnCl 2 , 1 mM dithio- 
threitol, and [ 3 H] (A) n - (dT) n (5 fM [ 3 H](A) n , 35 cpm/p- 
mole; 20 }M (dT) n ). Reactions were incubated at 37°C 
for 20 min and were stopped by adding 10 jxl of tRNA (1 
mg/ml) and 20 (Ml of cold 50% TCA. After 10 minutes on 
ice, the mixture was centrifuged for 10 minutes in an 
Eppendorf centrifuge. Forty y.1 of the supernatant was 
counted in aqueous scintillant. One unit of RNase H 
activity is the amount of enzyme required to solubilize 
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one mole of [ 3 H](A) n in [ 3 H] (A) n - (dT) n in 10 min at 
37'C. 

Synthesis of Poly (A) -Tailed RNA 

Synthetic 2.3 kb and 6.2 kb RNAs containing a 19 
nucleotide poly(A) tail at the 3 1 end were synthesized 
with T7 RNA polymerase from Xba I-cut pJD2 . 3 and Hind 
Ill-cut pHL3X, respectively. Reaction mixtures (0.3 ml) 
contained 40 mM Tris-HCl (pH 8.0}, 8 mM MgCl 2 , 2 mM 
spermidine-HCl , 5 mM dithiothreitol, 0.4 mM each of CTP, 
UTP, GTP, and ATP, 20 pg/ml DNA, and 2,000 units/ml T7 
RNA polymerase. Uniformly labeled RNA was synthesized 
with all four [a- 32 P]NTPs, each at 0.4 mM and 250 
cpm/pmole. After 1 hr incubation at 37 °C, the RNA 
product was phenol extracted, ethanol precipitated , and 
purified by oligo (dT) -cellulose chromatography to ensure 
the presence of a poly (A) tail. 
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Conditions for cDNA Synthesis 

When assessing the effect of cDNA synthesis upon 
the integrity of template RNA, reaction mixtures (50 jxl) 
contained 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM 
MgCl 2 , 10 mM dithiothreitol, 0.5 mM each dATP, dGTP, and 
dTTP, 0.5 mM [ 3 H]dCTP (200 cpm/pmol) , 50 fxg/ml (dT) 12 __ 
18 f 20 iig/ml 2.3 kb [ 32 P] labeled RNA, and 4,000 units/ml 
RT. The reactions were incubated at 37 °C and duplicate 
2.5 pi aliquots were removed at 0, 1, 5, 10, 30, and 60 
min. One aliquot was precipitated onto glass fiber 
filters using TCA to determine the amount of cDNA 
synthesized, and the other aliquot was prepared for 
glyoxal gel analysis. Carmichael, G.G., et al . , Method . 
Enzymol . 65:380-391 (1980). The glyoxalated RNA was 
fractionated on a 1% agarose gel, dried, and autoradio- 
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graphed. In some cases, 10 units of E. coli RNase H 
were added to the reaction mixture after 60 min and the 
incubation continued for 10 more min before aliquots 
were taken. 

When measuring the ability of RT to synthesize a 
cDNA copy of long RNA, reaction mixtures (10 /xl) 
contained the same buffer and salts, 0.5 mM each of 
dATP, dGTP, dTTP, and [a- 32 P]dCTP (600 cpm/pmole) , 50 

Mg/ml actinomycin D, 50 /ig/ml ( dT )i2-18 f 100 ^^ ml 6 * 2 
kb poly (A> -tailed RNA, and 20,000 units/ml RT. After 1 
hr at 37 °C, the product in an aliquot (1 /xl) was 
precipitated with TCA, counted, and the remaining DNA 
size fractionated on an alkaline 1.4% agarose gel 
according to McDonnel, M.W, et al . . J, Mol. Biol. 
110:119-146 (1977). 
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Purification of RT 

Cells were grown to an 
ampicillin medium (Gerard, G.F., et al . , DNA .5:271-279 
(1986)) at 30 °C, induced at 43 °C for 45 min, and then 
grown at 36 °C for 3.5 hr before harvesting. RT was 
extracted from 100 g of cells as described (Gerard, 
G.F., supra ) with the following exceptions. RT was 
precipitated by addition of solid (NH 4 )2S0 4 to 40% 
saturation. The (NH 4 )2S0 4 pellet was dissolved in 50 ml 
of 20 mM Tris-HCl (pH 7.5) , 1 mM EDTA, 0.1 M NaCl , 5% 
glycerol, 1 mM dithiothreitol , and 0.01% n-octyl-£-D- 
glucopyranoside, the suspension was clarified by 
centrifugation at 10,000 x g for 10 min, and the 
supernatant was desalted on a 320 ml (5 x 16 cm) 
Sephadex G-25 column run in buffer A (20 mM Tris-HCl, pH 
7.5, 1 mM dithiothreitol, 1 mM EDTA, 5% glycerol, 0.01% 
NP-4 0) plus 0.1 M NaCl. After phosphocellulose chroma- 
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tography, the RT peak was pooled, diluted with an equal 
volume of buffer A, and chromatographed on a 21-ml 
heparin-agarose column (1.5 x 12 cm) equilibrated in 
buffer A plus 0.1 M NaCl. The RT peak from the heparin- 
agarose column was chromatographed on a Mono-S HR 5/5 
column equilibrated in buffer A (Gerard, G.F., supra ) . 
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RESULTS 

Construction of Reverse Transcriptase Gene Deletions 

Deletions of the M-MLV RT gene were constructed by 
cutting pRT601 (Figure 1) at selected restriction sites 
within the RT gene, discarding the excised fragment, and 
joining the DNA ends in such a way that the translation 
reading frame of the gene was maintained. pRTdBam-Bam 
was constructed by deleting the Bam HI fragment between 
nucleotide positions 1,654 and 1,960 (Figures 1 and 2). 
Ligation of the Bam HI half sites at positions 1,654 and 
1,960 maintained the translation reading frame across 
the site. 

A deletion at the carboxy terminus of M-MLV RT 
(pRTdEcoRV-C) was constructed by deleting all of the 3 1 
end of the gene downstream of the Eco RV site at 
position 2,513 (Figures 1 and 2). To construct pRTd- 
EcoRV-C, a Sea I (position 6,238) to Eco RV (position 
2,513) fragment of pRT60l containing the 5* portion of 
the RT gene was ligated to a Sea I- Eco RI fragment 
derived from plasmid pBRT (Gerard, G.F., supra ) . The 
3,211 base pair pBRT Sea I -Eco RI fragment contained the 
pBR322 origin of replication and a universal translation 
terminator sequence just inside the Eco RI site. The 
Eco RI site was repaired with DNA polymerase I Klenow 
fragment before ligation. 
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The plasmid of pRTdEcoRV-C was deposited in E . coli 
under the terms of the Budapest Treaty at the American 
Type Culture Collection (ATCC) , Rockville, Maryland, and 
given accession number 67555. 

pRT603 was constructed as described (Gerard, G.R. , 
supra) which encodes an RT that contains 73 fewer amino 
acids than pRT601, all deleted from the carboxy terminus 
(Figure 2) • 



Saj dman, Stern t.. 
vEssler & Goldstein 

»»*S-"<&TOn Ci C WOK. 



DNA Polymerase and RNase H Levels in Cells Bearing 
Deletion Plasmids 

Alteration of as little as 3 amino acids at the 
carboxy end of M-MLV RT can influence markedly the 
stability of the protein in B, coli (Gerard, G.F. 
supra ) . This must be taken into consideration in making 
correlations between cloned RT deletions and enzymatic 
activities in E, coli extracts. Both DNA polymerase and 
RNase H activity must be assayed and relative enzyme 
levels compared. For example, pRT603 codes for an RT 
with 73 fewer amino acids at the carboxy terminus than 
pRT601 RT (Gerard, G.F., supra ; Figure 2). The level of 
DNA polymerase activity in E . coli extracts of pRT603 RT 
is reduced 5 fold relative to pRT601. (Gerard, G.F. 
supra) . However, the DNA polymerase and RNase H 
specific activities of purified pRT601 and pRT603 RT are 
comparable (Table 2) . The reduced DNA polymerase 
activity in E. coli extracts of pRT603 RT is not due to 
a selective effect of the deletion on DNA polymerase 
activity, but rather to a reduction in the stability of 
pRT603 RT relative to pRT601 RT in cells ( t 1/ ^ 2 of 7 min 
versus 33 min) (Gerard, G. F. supra ) . Therefore, 
deletions within 70 amino acids of the RT carboxy 
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terminus do not affect either RNase H or DNA polymerase 
activity. 

In contrast, the DNA polymerase activity of 
pRTdBam-Bam RT was eliminated totally without affecting 
RNase H activity (Table 1) by the deletion of 102 amino 
acid residues between amino acids 212 and 314 (Figure 

2) • Introduction of a more extensive deletion of 180 
amino acids at the carboxy end of RT in pRTdEcoRV-C RT 
(Figure 2) yielded extracts with RT DNA polymerase 
levels unchanged compared to pRT601 extracts, but with 
RNase H levels reduced 7,5 fold (Table 1) . The residual 
RNase H activity in pRTdEcoRV-C extracts could be due to 
E. coli RNase H, the 5* — >3 f exonuclease of DNA polymer- 
ase I, or a small amount of residual RT-coded RNase H 
activity* To resolve this issue, pRTdEcoRV-C RT was 
purified and compared to RT encoded by pRT60l. 

Purification and Properties of pRTdEcoRV-C RT 

M-MLV reverse transcriptase encoded by pRTdEcoRV- 
C f pRT601, and pRT603 were purified as described in 
Materials and Methods. A summary of the purification of 
pRTdEcoRV-c RT is presented in Table 3. Three column 
steps produced a nearly homogeneous mutant enzyme with 
the same DNA polymerase specific activity as pRT60l RT 
with the template-primer (Cm) n • ( dG )x2-18 ( Tab l e 2 ) • 
With (A) n * (dT) 22-18' the mutant e nzyme had one-fourth 
t:he DNA polymerase activity of pRT601 RT (Table 2) • 
RNase H activity of purified pRTdEcoRV-C RT was undetec- 
table using [ 3 H] (A) n * (dT) n as the substrate. Most RNase 
H activity in extracts was eliminated from mutant RT by 
precipitation of the enzyme with 40% (NH 4 )2S0 4 (Table 

3) . Under these conditions, DNA polymerase I remains 
soluble (Richardson, C. , et al . , J. Biol. Chem. 239 : 222- 
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230 (1964)), as does most of the RNase H activity in the 
extract- As judged by SDS-polyacrylamide gel electro- 
phoresis, pRTdEcoRV-C RT purified through the Mono-S 
column was greater than 90% pure and had a molecular 
weight of 56,000 (Figure 3), consistent with the 
molecular weight (57 # 000) predicted by the DNA sequence. 

A number of enzymatic properties of purified 
pRTdEcoRV-C RT and pRT601 RT were compared and were 
found to be similar. These included half life at 37 *C, 
monovalent and divalent metal ion optima, fidelity of 
dNTP incorporation with homopolymer templates, and 
insensitivity to stimulation by polyanions. The 
abilities of the two enzymes to synthesize heteropoly- 
meric DNA were also compared. Figure 4 shows that 
pRTdEcoRV-C RT catalyzed the synthesis of full-length 
cDNA from 6.2 kb RNA more efficiently than pRT601 RT. 
The amount of cDNA synthesized from 1 jig of RNA was 0.28 
/ig (34% full-length) and 0.24 jxg (24% full-length) with 
pRTdEcoRV-C RT and pRT601 RT, respectively. 

To confirm that pRTdEcoRV-C RT completely lacked' 
RNase H activity, the integrity of a uniformly 32 p- 
labeled RNA template after conversion to hybrid form 
during RT-catalyzed DNA synthesis was examined. Figure 
5 shows that with pRT601 RT, the full-length 2.3 kb RNA 
template was degraded totally after 5 min of synthesis. 
In contrast, with pRTdEcoRV-C RT the RNA was intact even 
after 60 min. The amount of cDNA synthesized after 60 
min from 1 fig of RNA was 0.67 and 0.76 ^g with pRT601 
and pRTdEcoRV-C RT, respectively. When 10 units of E . 
coli RNase H were added to the pRTdEcoRV-C RT reaction 
after 60 min of incubation, all of the RNA was degraded, 
confirming the hybrid state of the RNA . In addition, 15 
/ig (1,200 units) of pRTdEcoRV-C RT solubilized no 
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radioactivity from a [ 3 H](A) n • (dT) n substrate in which 
the [ 3 H](A) n had a specific activity of 2,200 cpm/pmole 
(Materials and Methods) . 

Experiments with a frameshift mutant of MLV 
producing a 47K RT molecule truncated at the carboxy 
terminus (Levin, J.G. et al. , J. Virol . 51: 470-478 

(1984) ) and with antibodies to synthetic peptides 
modeled to Rous sarcoma virus pol gene sequences 
(Grandgenett , D. et al . . J. Biol* Chem . 260 : 8243-8249 

(1985) ) suggest the RNase H activity of RT resides 
within the amino-terminal portion of the molecule. 
Conversely, the extensive homology found between the 
amino acids of Ejl coli RNase H and the 153-residue 
segment at the carboxy-terminal end of M-MLV RT (John- 
son, M.S. et al . Proc. Natl. Acad. Sci (USA) 83 :7648- 
7652 (1986)) suggests the RNase H activity resides 
within the carboxy-terminal portion of RT. 

By deleting large segments (100 to 200 codons) of 
the M-MLV RT gene, the regions within the RT molecule 
responsible for DNA polymerase and RNase H activity have 
been identified. DNA polymerase was mapped to the amino 
half of the molecule, and RNase H to within 200 amino 
acids of the carboxy end, confirming the predictions 
based upon amino acid homology (Johnson, M.S. et al . , 
supra ) . In this context, the results with one RT clone, 
pRT603 (Figure 2), are of interest. The RT protein 
encoded by pRT603 is missing the carboxy half of the 153 
amino acid segment of RT homologous to JL-_ coli RNase H, 
which includes 2 0 of 4 8 homologous amino acids. Yet, 
pRT603 RT has normal levels of RNase H activity. These 
missing, homologous residues apparently are not required 
for catalysis, and might serve a nucleic acid binding or 
structural role. Consistent with the latter, a single 
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amino acid change at a position 12 residues from the 
carboxy end of £. coli RNase H produces a 10-fold 
reduction in RNase H specific activity (Kanaya, S. et 
ah . J. Bacterid . 154 :1021-1026 (1983)). This reduc- 
tion appears to be the result of altered protein 
conformation (Kanaya, S. et al. . supra ) . 

If the RT polymerase and nuclease active sites 
reside on separate structural domains, it should be 
possible theoretically to isolate two separate protein 
fragments, each with a single activity. A 24K to 30K 
proteolytic fragment of RT possessing only RNase H 
activity has been isolated (I>ai, M.H.T. et al. , J. 
Virol . 25:652-663 (1978); Gerard, G.F. , J. Virol . 26: 16- 
28 (1978); Gerard, G.F,, J. Virol . 37:748-754 (1981)), 
but unfortunately, the location of this RNase H fragment 
in the parent RT polypeptide has not been established, 
and no analogous DNA polymerase containing fragment has 
ever been found. The results presented here show that 
of the 68 4 amino acids in pRT601 RT, residues between 
amino acid 212 and 314 are required for DNA polymerase 
activity, and residues between amino acid 503 and 611 
are required for RNase H activity. They also demon- 
strate for the first time that the RT DNA polymerase 
activity can exist independently of RNase H activity on 
an RT protein fragment. Purified pRTdEcoRV-C RT 
appeared to be totally devoid of RNase H activity, based 
upon two sensitive assays, and to have full DNA polymer- 
ase activity. However, these results do not rule out 
the possibility that the two active centers share a 
portion (s) of the RT molecule. 

Demonstration of a separate structural domain for 
the RNase H active center was attempted by constructing 
two amino-terminal deletion derivatives of pRT601. The 
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first derivative contained sequences for the Eco RV site 
at position 2513 to the 3* end of the RT gene (see 
Figure 2) , and the second contained sequences from an 
Nco I site at position 2302 to the 3' end of the RT 
gene* Unfortunately, neither clone produced detectable 
RNase H activity in JU. coli crude extracts. Such 
negative results are difficult to interpret because the 
proteins might be unable to fold in an active form, or 
might be extremely labile. 

Deletion of the carboxyrterminal one-fourth of the 
M-MLV RT molecule did not disrupt the ability of the 
protein to fold in an active conformation. pRTdEcoRV-c 
RT copied heteropolymeric RNA more efficiently than 
intact RT. Yields of cDNA from 1 of 2.3 kb and 6.2 
kb RNA were 0.76 fig (50% full-length) and 0.28 (34% 
full-length) , respectively. Also, the truncated and 
intact enzymes had the same DNA polymerase specific 
activity with (Cm) n . (dG) 12 -18* However, the truncated 
enzyme copied (A) n . (dT) 12-18 only one fourth as effi- 
ciently as the parent RT. The origin of this difference 
has not yet been established. 
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derivatives. 



Plasmid 



CK\ polymerase 
Activity 



Rfese H 
Activity 15 



pKT601 


10,977 


2,020 


pRMBam-Bam 


179 


1,564 


pKWEocKV-C 


10,038 


268 



a Reverse xrare^^^, — ^^EIh3X.i 
(Cm) n . (e*S) 12-18 (Materials and Methods) . 

b PNase H activity was assayed with [%] (A) n . (dT) n (Materials and 
Methods) . 
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Tfcble 2. Odipari^on of activities of purified KT coded by pnXDl, 
pRT603, and pim3EcoKV-C 



pRT601 


21,700 


350, 


000 


2,670 


pRT603 




230, 


000 


1,100 


pKWBocKV-C 


17,500 


81, 


,000 


_b 



a ND, not determined 

b No activity was detected 
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